Speech-based gender recognition using linear prediction and mel-frequency cepstral coefficients
نویسندگان
چکیده
Gender discrimination and awareness are essentially practiced in social, education, workplace, economic sectors across the globe. A person manifests this attribute naturally gait, body gesture, facial, including speech. For that reason, automatic gender recognition (AGR) has become an interesting sub-topic speech systems can be found many technology applications. However, retrieving salient gender-related information from a signal is challenging problem since contains abundant apart gender. The paper intends to compare performance of human vocal tract-based model i.e., linear prediction coefficients (LPC) auditory-based Mel-frequency cepstral (MFCC) which popularly used other tasks by experimentation optimal feature parameters classifier’s parameters. audio data study was obtained 93 speakers uttering selected words with different vowels. two vectors were tested using classification algorithms namely, discriminant analysis (DA) artificial neural network (ANN). Although experimental results promising both parameters, best overall accuracy rate 97.07% recorded MFCC-ANN techniques almost equal for male female classes.
منابع مشابه
The Capacity of Mel Frequency Cepstral Coefficients for Speech Recognition
Speech recognition is of an important contribution in promoting new technologies in human computer interaction. Today, there is a growing need to employ speech technology in daily life and business activities. However, speech recognition is a challenging task that requires different stages before obtaining the desired output. Among automatic speech recognition (ASR) components is the feature ex...
متن کاملNeuro Based Approach for Speech Recognition by Using Mel-frequency Cepstral Coefficients
NEURO BASED APPROACH FOR SPEECH RECOGNITION BY USING MEL-FREQUENCY CEPSTRAL COEFFICIENTS R.L.K. Venkateswarlu1 and R. Vasanthakumari2 1 Department of Information Technology, Sasi Institute of Technology and Engineering, Tadepalligudem, India, E-mail: [email protected]. 2 Perunthalaivar Kamarajar Arts College, Puducherry-605107, India, E-mail: [email protected]. This paper presents continu...
متن کاملSpeech reconstruction from mel frequency cepstral coefficients and pitch frequency
This paper presents a novel low complexity, frequency domain algorithm for reconstruction of speech from the melfrequency cepstral coe cients (MFCC), commonly used by speech recognition systems, and the pitch frequency values. The reconstruction technique is based on the sinusoidal speech representation. A set of sine-wave frequencies is derived using the pitch frequency and voicing decisions, ...
متن کاملAnalysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures.
The aim of this work is to develop methods that enable acoustic speech features to be predicted from mel-frequency cepstral coefficient (MFCC) vectors as may be encountered in distributed speech recognition architectures. The work begins with a detailed analysis of the multiple correlation between acoustic speech features and MFCC vectors. This confirms the existence of correlation, which is fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Indonesian Journal of Electrical Engineering and Computer Science
سال: 2022
ISSN: ['2502-4752', '2502-4760']
DOI: https://doi.org/10.11591/ijeecs.v28.i2.pp753-761